skip to main content


Search for: All records

Creators/Authors contains: "Ning, Xia"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Retrosynthesis is a procedure where a target molecule is transformed into potential reactants and thus the synthesis routes can be identified. Recently, computational approaches have been developed to accelerate the design of synthesis routes. In this paper,we develop a generative framework G2Retro for one-step retrosynthesis prediction. G2Retro imitates the reversed logic of synthetic reactions. It first predicts the reaction centers in the target molecules (products), identifies the synthons needed to assemble the products, and transforms these synthons into reactants. G2Retro defines a comprehensive set of reaction center types, and learns from the molecular graphs of the products to predict potential reaction centers. To complete synthons into reactants, G2Retro considers all the involved synthon structures and the product structures to identify the optimal completion paths, and accordingly attaches small substructures sequentially to the synthons. Here we show that G2Retro is able to better predict the reactants for given products in the benchmark dataset than the state-of-the-art methods. 
    more » « less
    Free, publicly-accessible full text available May 30, 2024
  2. Abstract Motivation

    MHC Class I protein plays an important role in immunotherapy by presenting immunogenic peptides to anti-tumor immune cells. The repertoires of peptides for various MHC Class I proteins are distinct, which can be reflected by their diverse binding motifs. To characterize binding motifs for MHC Class I proteins, in vitro experiments have been conducted to screen peptides with high binding affinities to hundreds of given MHC Class I proteins. However, considering tens of thousands of known MHC Class I proteins, conducting in vitro experiments for extensive MHC proteins is infeasible, and thus a more efficient and scalable way to characterize binding motifs is needed.

    Results

    We presented a de novo generation framework, coined PepPPO, to characterize binding motif for any given MHC Class I proteins via generating repertoires of peptides presented by them. PepPPO leverages a reinforcement learning agent with a mutation policy to mutate random input peptides into positive presented ones. Using PepPPO, we characterized binding motifs for around 10 000 known human MHC Class I proteins with and without experimental data. These computed motifs demonstrated high similarities with those derived from experimental data. In addition, we found that the motifs could be used for the rapid screening of neoantigens at a much lower time cost than previous deep-learning methods.

    Availability and implementation

    The software can be found in https://github.com/minrq/pMHC.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  3. Abstract

    Effective and successful clinical trials are essential in developing new drugs and advancing new treatments. However, clinical trials are very expensive and easy to fail. The high cost and low success rate of clinical trials motivate research on inferring knowledge from existing clinical trials in innovative ways for designing future clinical trials. In this manuscript, we present our efforts on constructing the first publicly available Clinical Trials Knowledge Graph, denoted as$$\mathop {\mathtt {CTKG}}\limits$$CTKG.$$\mathop {\mathtt {CTKG}}\limits$$CTKGincludes nodes representing medical entities in clinical trials (e.g., studies, drugs and conditions), and edges representing the relations among these entities (e.g., drugs used in studies). Our embedding analysis demonstrates the potential utilities of$$\mathop {\mathtt {CTKG}}\limits$$CTKGin various applications such as drug repurposing and similarity search, among others.

     
    more » « less
  4. We propose an innovative machine learning paradigm enabling precision medicine for prioritizing cognitive assessments according to their relevance to Alzheimer's disease at the individual patient level. The paradigm tailors the cognitive biomarker discovery and cognitive assessment selection process to the brain morphometric characteristics of each individual patient. We implement this paradigm using a newly developed learning-to-rank method PLTR. Our empirical study on the ADNI data yields promising results to identify and prioritize individual-specific cognitive biomarkers as well as cognitive assessment tasks based on the individual's structural MRI data. The resulting top ranked cognitive biomarkers and assessment tasks have the potential to aid personalized diagnosis and disease subtyping. 
    more » « less
  5. Pharmacoinformatics research has experienced a great deal of successes in detecting drug‐induced adverse events (AEs) using large‐scale health record databases. In the era of polypharmacy, pharmacoinformatics faces many new challenges, and two significant challenges are to detect high‐order drug interactions and to handle strongly correlated drugs. In this article, we propose a super‐combo‐drug test (SupCD‐T) to address the aforementioned two challenges. SupCD‐T detects drug interactions by identifying optimal drug combinations with increased AE risks. In addition, SupCD‐T increases the statistical powers to detect single‐drug effects by combining strongly correlated drugs. Although SupCD‐T does not distinguish single‐drug effects from their combination effects, it is noticeably more powerful in selecting an individual drug effect in the multiple regression analysis, where confounding justification between two correlated drugs reduces the power in testing the individual drug effects on AEs. Our simulation studies demonstrate that SupCD‐T has generally better power comparing with the multiple regression analysis. In addition, SupCD‐T is able to select meaningful drug combinations (eg, highly coprescribed drugs). Using electronic health record database, we illustrate the utility of SupCD‐T and discover a number of drug combinations that have increased risk in myopathy. Some novel drug combinations have not yet been investigated and reported in the pharmacology research.

     
    more » « less
  6. Background

    While the pharmacokinetic (PK) mechanisms for many drug interactions (DDIs) have been established, pharmacovigilance studies related to these PK DDIs are limited. Using a large surveillance database, a translational informatics approach can systematically screen adverse drug events (ADEs) for many DDIs with known PK mechanisms.

    Methods

    We collected a set of substrates and inhibitors related to the cytochrome P450 (CYP) isoforms, as recommended by the United States Food and Drug Administration (FDA) and Drug Interactions Flockhart table™. The FDA's Adverse Events Reporting System (FAERS) was used to obtain ADE reports from 2004 to 2018. The substrate and inhibitor information were used to form PK DDI pairs for each of the CYP isoforms and Medical Dictionary for Regulatory Activities (MedDRA) preferred terms used for ADEs in FAERS. A shrinkage observed‐to‐expected ratio (Ω) analysis was performed to screen for potential PK DDI and ADE associations.

    Results

    We identified 149 CYP substrates and 62 CYP inhibitors from the FDA and Flockhart tables. Using FAERS data, only those DDI‐ADE associations were considered that met the disproportionality threshold ofΩ > 0 for a CYP substrate when paired with at least two inhibitors. In total, 590 ADEs were associated with 2085 PK DDI pairs and 38 individual substrates, with ADEs overlapping across different CYP substrates. More importantly, we were able to find clinical and experimental evidence for the paclitaxel‐clopidogrel interaction associated with peripheral neuropathy in our study.

    Conclusion

    In this study, we utilized a translational informatics approach to discover potentially novel CYP‐related substrate‐inhibitor and ADE associations using FAERS. Future clinical, population‐based and experimental studies are needed to confirm our findings.

     
    more » « less